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Abstract 



Considering that a trader or a trading algorithm interacting with markets during continuous 
auctions can be modeled by an iterating procedure adjusting the price at which he posts orders at a 
given rhythm, this paper proposes a procedure minimizing his costs. We prove the a.s. convergence 
of the algorithm under assumptions on the cost function and give some practical criteria on model 
parameters to ensure that the conditions to use the algorithm are fulfilled (using notably the co- 
monotony principle). We illustrate our results with numerical experiments on both simulated data 
and using a financial market dataset. 
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1 Introduction 

In recent years, with the growth of electronic trading, most of the transactions in the markets occur in 
Limit Order Books. During the matching of electronic orders, traders send orders of two kinds to the 
market: passive {i.e. limit or patient orders) which will not give birth to a trade but will stay in the 
order book (sell orders at a higher price than the higher hid price or buy orders at a lower price than 
the lower ask price are passive orders) and aggressive orders [i.e. market or impatient orders) which 
will generate a trade (sell orders at a lower price than the higher passive buy price or buy orders at a 
higher price than the lowest passive price). When a trader has to buy or sell a large number of shares, 
he cannot just send his large order at once (because it would consume all of the available liquidity in 
the order book, impacting the price at his disadvantage); he has first to schedule his trading rate to 
strike balance between the market risk and the market impact cost of being too aggressive (too many 
orders exhaust the order book and makes the price move). Several theoretical frameworks have been 
proposed for optimal scheduling of large orders (see [3], [7], [24], [2]). Once this optimal trading rate 
is known, the trader has to send smaller orders in the (electronic) limit order book by alternating 
limit (i.e. patient or passive) orders and market {i.e. urgent or aggressive) orders. The optimal mix of 
limit and market orders for a trader has not been investigated in the quantitative literature even if it 
has been studied from a global economic efficiency viewpoint (see for instance [9]). It has not either 
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been investigated from the viewpoint of one trader trying to optimize his own interactions with other 
market participants. One of the difficulties from the trader prospective is that transactions obtained 
by inserting a passive order in a hmit order book is a functional of its distance to the mid-price, giving 
birth to a large number of possible tactics in terms of reassessment of the price of such orders. 

In this paper, we study the optimal distance to submit passive orders in a limit order book, without 
needing a model of the limit order book dynamics (see e.g. [1] or [11] for such models of limit order 
books). 

Optimal submission strategies have been studied in the microstructure literature using utility 
framework and optimal control (see [4], [10], [5], [12] and [13]). The authors of such papers consider 
an agent who plays the role of a market maker, i.e. he provides liquidity on the exchange by quoting 
bid and ask prices at which he is willing to buy and sell a specific quantity of assets. Strategies for 
bid and ask orders are derived by maximizing his utility function. 

Our approach is different: we consider an agent who wants to buy (or sell) during a short period 
[0, T] a quantity Qt of traded assets and we look for the optimal distance where he has to post his 
order to minimize the execution cost. 

We are typically at a smaller time scale than in usual optimal liquidation frameworks. In fact order 
posting strategies derived from the viewpoint presented below can be "plugged" into any larger scale 
strategy. We are modeling the market impact of an aggressive order using a penalization function 
K ■ $(Q), where Q is the size of the market order. 

If a stochastic algorithm approach has been already proposed in [20] by the authors for optimal 
spatial split of orders across different Dark Pools, here the purpose is not to control fractions of the 
size of orders, but to adjust successive posting prices to converge to an optimal price. Qualitatively, 
this framework can be used as soon as a trader wants to trade a given quantity Qt over a given time 
interval [0, T] with no firm constraint on its trading rate between and T. It is typically the case 
for small Implementation Shortfall benchmarked orders. The trader can post his order very close to 
the "/azr price" {St)t^[o^x] (which can be seen as the fundamental price, the mid price of the available 
trading venues or any other reference price). In this case he will be exposed to the risk to trade too 
fast at a "bad price" and being adversely selected. Conversely he can post it far away from the fair 
price; in that case he will be exposed to never obtain a transaction for the whole quantity Qt, but 
only for a part of it (say the positive part of Qt — Nt, where Nt is the quantity that the trading 
flow allowed him to trade). He will then have to consume aggressively liquidity with the remaining 
quantity, disturbing the market and paying not only the current market price St, but also a market 
impact (say St ^{Qt — -^t) where <I> is a market impact penalization function). 

The approach presented here follows the mechanism of a "learning trader" . He will try to guess 
the optimal posting distance to the fair price achieving the balance between being too demanding in 
price and too impatient, by successive trials, errors and corrections. The optimal recursive procedure 
derived from our framework gives the best price adjustment to apply to an order on given stopping 
time (reassessment dates) given the observed past on the market. We provide proofs of the convergence 
of the procedure and of its optimality. 

To this end, we model the execution process of orders by a Poisson process {Nj^^'')Q<t<T which 
intensity At{6,S) depends on the fair price {St)^^Q and the distance of order submission 6. The 
execution cost results from the sum of the price of the executed quantity and a penalization function 
depending on the remaining quantity to be executed at the end of the period [0,T]. This penalty 
K ■ ^{Q) models the over-cost induced by crossing the spread and the resulting market impact of this 
execution. The aim is to find the optimal distance 5* £ [0, (5max]) where Jmax is the depth of the limit 
order book, which minimizes the execution cost. In practice, the prices are constrained to be on a 
"tick size grid" (see [25]), instead of being on the real line. We will follow the approach of papers 
on market making [14, 4, 10] assuming the the tick size is small enough to not change the dynamics 
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of the price and of the bid-ask spread [22]. Nevertheless, a "rounding effect" would not change the 
nature of our results, since any projection {e.g. to nearest-neighbor) is a non-decreasing transform and 
the CO- monotony principle still holds (by slightly adapting the proofs). This leads to an optimization 
problem under constraints which we solve by using a recursive stochastic procedure with projection 
(this particular class of algorithm is studied in [18] and [19]). We prove the a.s. convergence of the 
constrained algorithm under additional assumptions on the execution cost function. From a practical 
point of view, it is not easy to check the conditions on the cost function. So we give criteria on 
the model parameters which ensure the viability of the algorithm which relies on the co-monotony 
principle assumed to be satisfied by the "fair price" process {St)t^[o^T]- This principle will be detailed 
in the Appendix Section B. We conclude this paper by some numerical experiments with simulated 
and real data. We consider the Poisson intensity presented in [4] and use a Brownian motion to model 
the fair price dynamics. We plot the cost function and its derivative and show the convergence of the 
algorithm to its target 6*. 

The paper is organized as follows: in Section 2, we first propose a model for the execution process of 
posted orders, then we define a penalized execution cost function (including the market impact at the 
terminal execution date). Then we devise the stochastic recursive procedure under constraint to solve 
the resulting optimization problem in terms of optimal posting distance on the limit order book. We 
state the main convergence result and provide operating criteria that ensure this convergence, based 
on a co-monotony principle for one dimensional diffusions. Section 3 establishes the representations 
as expectations of the cost function and its derivatives which allow to define the mean function of the 
algorithm. Section 4 presents the convergence criteria (which ensure that the optimization is well- 
posed) derived from the principle of co-monotony established in Section B of the appendix. Finally 
Section 5 illustrates with numerical experiments the convergence of the recursive procedure towards 
its target. 

Notations. • (x)+ = maxjx, 0} denotes the positive part of x, [x\ = max{A; E N : A; < x} [[0, xj := 



• C{[0,T], A) := {/ : [0,T] — )• A continuous} (equipped with the supnorm topology) and D([0, T], j4) := 
{/ : [0, T] A cadlag} (equipped with the Skorokhod topology when necessary see [15]) where cadlag 
means right continuous with left limits and A = R^, Mj^, etc. They are equipped with the standard 
Borel cr-field a{a ^ a{t),t G [0,r]). 

• P-esssup/ = inf{a G M : P({x : f{x) > a}) = 0}, ||a||^ = supigfo^r] \a{t)\, a G D([0,r],M) and fj, 
denotes the left derivative of /. 

2 Design of the execution procedure and main results 
2.1 Modeling and design of the algorithm 

We focus our work on the problem of optimal trading with limit orders on one security without needing 
to model the limit order book dynamics. We only model the execution flow which reaches the price 
where the limit order is posted with a general price dynamics {St)t(z[o^T] since we intend to use real 
data. However there will be two frameworks for the price dynamics: either {St)t^[o,T] is a process 
bounded by a constant L (which is obviously an unusual assumption but not unrealistic on a short 
time scale see Section 2.2.2) or {St)te[o,T] is ruled by a Brownian diffusion model (see Section 2.2.1). 
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We consider on a short period T, say a dozen of seconds, a Poisson process modeling the execution 
of posted passive buy orders on the market 



{^t^^)Q<i<x with intensity At{6,S) 



r X{St - {So - 6))dt (2.1) 
Jo 



where < S < 5max with 6mnx the depth of the order book, Jmax £ (0, Sq), and (5'()j>o is a stochastic 
process modehng the dynamics of the "fair price" of a security stock (from an economic point of view). 
In practice one may consider that St represents the best opposite price at time t. 
One way to build N^^^ is to set 



(5) 



^7(f A(5t-(5o-<5))d< 



where A'" is a Poisson process with intensity 1 independent of the price {St)t^[f)^T]- 

This representation underlines the fact that for one given trajectory of the price St, the intensity 
of the Point process N is decreasing with 6: in fact the above representation for N^^^ is even pathwise 
consistent in the sense that if < S < 6' then 



-a.s. (VtG[0,r], Aff^<iV; 



r(S') 



The natural question how to account for the actually real possibility of simultaneously placing limit 
orders at different prices may be much harder to handle and is not studied in this paper. In partic- 
ular, due to interacting impact features, it would need a more sophisticated approach then simply 
considering (A^'^^'^^)i<fc<i<:, processes as above with 6{1) < 6{2) < . . . < S{K) (with the same N). 

We assume that the function A is defined on [— ^o, +00) as a finite non-increasing convex function. 
Its specification will rely on parametric or non parametric statistical estimation based on former 
obtained transactions (see Figure 1 below and Section 5). At time t = 0, buy orders are posted in the 
limit order book at price So — 5. Between t and t + At, the probability for such an order to be executed 
is X{St — {Sq — 5))At where St — (So — 6) is the distance to the current fair price of our posted order 
at time t. The further the order is at time t, the lower is the probability for this order to be executed 
since A is decreasing on [— S'0,+00). Empirical tests strongly confirm this kind of relationship with a 
convex function A (even close to an exponential shape, see Figure 1). Over the period [0, T], we aim 



Figure 1: Empirical probabilities of execution (blue stars) and its fit with an exponential law (red 
dotted line) with respect to the distance to the "fair price" . 

at executing a portfolio of size Qt € N invested in the asset S. The execution cost for a distance 5 is 
E (Sr\ — 6) [Qt A n!^^] . We add to this execution cost a penalization depending on the remaining 
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quantity to be executed, namely at the end of the period T, we want to have Qt assets in the portfoho, 
so we buy the remaining quantity (Qt — N^^\ at price 5^- 



At this stage, we introduce a market impact penalization function <I> : M i— )• M+, non-decreasing 
and convex, with $(0) = to model the additional cost of the execution of the remaining quantity 
(including the market impact). Then the resulting cost of execution on a period [0,T] reads 

C{5) :=e[(So-5)(QtA7V^'^^) +KST$((QT-iV'?^)+)] (2.2) 

where At > is a free tuning parameter. When ^{Q) = 1, we just consider that we buy the remaining 
quantity at the end price St- Introducing a market impact penalization function <l>(x) = (1 + ri{x))x, 
where ^/ > 0, ^ 0, models the market impact induced by the execution of (^Qt — ^t'^^+ time 
T whereas we neglect the market impact of the execution process via limit orders over [0,T). Our 
aim is then to minimize this cost by choosing the distance to post at, namely to solve the following 
optimization problem 

min C{6). (2.3) 

0<5<5max 

Our strategy to solve numerically (2.3) using a large enough dataset is to take advantage of the 
representation of C and its first two derivatives as expectations to devise a recursive stochastic algo- 
rithm, namely a stochastic gradient procedure, to find the minimum of the (penalized) cost function 
(see below). Furthermore we will show that under natural assumptions on the quantity Qt to be 
executed and on the parameter k, the function C is twice differentiable, strictly convex on [0, (5niax] 
with C"(0) < 0. Consequently, 

argmin5g[o_5^^^^]C(5) = {5*}, 5* G (0,(5max] 

and 

5* = (5max iff C is non-increasing on [0, Jmax]- 

Criteria involving k and based on both the risky asset S and the trading process especially the 
execution intensity A, are established further on in Proposition 4.1 and Proposition 4.2. We specify 
representations as expectations of the function C and its derivatives C and C" . In particular we will 
exhibit a Borel functional 

H : [0,W] xI])([0,T],M) 

such that 

V<5 G [0,5n.ax], C'{5) =¥.[h[5, (5t)te[o,T]) 

The functional H has an explicit form given in Proposition 3.2, Equations (3.14) or (3.16), involving 
integrals over [0, T] of the intensity \{St — Sq + 5) of the Poisson process 

)te[o,T]- In particular, 

any quantity H (5, {St)t^[o^T]) can be simulated, up to a natural time discretization, either from a true 
dataset (of past executed orders) or from the stepwise constant discretization scheme of a formerly 
calibrated diffusion process modeling (5t)tg[o,T] (see below). This will lead us to replace for practical 
implementations the continuous time process {St)t(^[Q,T] over [0, T], either by a discrete time sample, 
i.e. a finite dimensional M"^"'"^-valued random vector (5't.)o<i<m (where to = and t^ = T) or by a 
time discretization scheme with step ^ (typically the Euler scheme when {St)ti=[o,T] is a diffusion). 

A THEORETICAL STOCHASTIC LEARNING PROCEDURE: Based ou this representation (3.14) of C, we 
can formally devise a recursive stochastic gradient descent a.s. converging toward 5* . However to make 
it consistent, we need to introduce constrain so that it lives in [0,(5niax]- In the classical literature on 
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Stochastic Approximation Theory (see [18] and [19]) this amounts to consider a variant with projection 
on the "order book depth interval" [0,5max]i namely 

6n+l = Proj[o,5,^,J (<5„ - -fn+lH (^6n, ('5'J"^^^)te[o,T])) 1 n>Oi (0,(5max), (2.4) 

where 

• Projjo 5^^^^] denotes the projection on the (nonempty closed convex) [0,(5max]! 

• (7n)n>i is a positive step sequence satisfying (at least) the minimal decreasing step assumption 
Y^n>i 7n = +0O and 7^ 0. 

• the sequence |('S'j"^)tG[o,T]i ^ ^ o|) is the "innovation" sequence of the procedure : ideally it 
is either a sequence of simulable independent copies of {St)te[o^T] or a sequence sharing some 
ergodic (or averaging) properties with respect to the distribution of {St)t^[o^T]- 

The case of independent copies can be understood as a framework where the dynamics of S is typically a 
Brownian diffusion solution to an stochastic differential equation, which has been calibrated beforehand 
on a dataset in order to be simulated on a computer. The case of ergodic copies corresponds to a 
dataset which is directly plugged into the procedure i.e. S^"^^ = St-nAt, i ^ [0, T], n > 0, where At > 
is a fixed shift parameter. To make this second approach consistent, we need to make the assumption 
that at least within a laps of a few minutes, the dynamics of the asset S (starting in the past) is 
stationary and shares e.g. mixing properties. 

The resulting implementable procedure: In practice, the above procedure cannot be imple- 
mented since the full path (5't(a;))ig[o_T] of a continuous process cannot be simulated nor a functional 
H{5, (5f(a;))fg[o^T]) of such a path can be computed. So we are led in practice to replace the "copies" 
by copies of a time discretization of step, say At = ^, (m G N*). The time discretizations are 
formally defined in continuous time as follows 

- - iT 
St = Sti, t e [ti,ti+i), i = 0, . . . ,m - 1 with ti = — , i = 0, . . . ,m, 

m 

where (5tJo<i<m = iStJo<k<m if (5'tJo<i<m can be simulated (see e.g. [6] for ID-Brownian diffusions 
processes). The sequence (5t.)o<i<m can also be a time discretization scheme (at times ti) of {St)ti^[o^T)^ 
typically an Euler scheme with step ^. 

Then, with an obvious abuse of notation for the function H, we can write the implementable 
procedure as follows: 

,5„+i = Proj[o,5_](5„-7n+i^('^n,(sf+'\<,<J), n>0, <5oG[0,J^ax] (2.5) 

where {^1^^) q^-^^ are copies of (St^) q.^^^^^ either independent or sharing "ergodic" properties, namely 
some averaging properties in the sense of [21]. In the first case, one will think about simulated data 
after a calibration process and in the second case to a direct implementation of a historical high 
frequency data base of best opposite prices of the asset S (with e.g. 5^"^ = S^,_^t_). 

2.2 Main convergence results 

The following theorems give a.s. convergence results for the stochastic procedure (2.4): the first one 
for i.i.d. sequences and the second one for "averaging" sequences (see [21]). 
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2.2.1 I.i.d. simulated data from a formerly calibrated model 

In this section, we consider that the innovation process { (5^"^)^^^^^, n > O} comes from a diffusion 
model beforehand calibrated on real data which can be simulated at time ti, < i < m, either exactly 
or via a stepwise constant time discretization scheme. 

Theorem 2.1. (a) Theoretical procedure. Assume that C is strictly convex [0,(5max] with 
C(0) < 0. Let (5't"^)(g[o n > 1, be a sequence of i.i.d. copies of {St)te[o,T]- Furthermore, as- 
sume that the decreasing step sequence satisfies the standard "decreasing step assumption" 

^7n = +oo and ^7^<+oo. (2.6) 

n>l n>l 

Then the recursive procedure defined by (2.4) converges a. s. towards its target 6* = argmin^g[o,5max]^('^)- 

(b) Implementable procedure. Assume the cost function C related to the discretization scheme 
{St)te[Q,T] is strictly convex [0, 5max] with C"(0) < and the step sequence satisfies the "decreasing 
step" assumption. Let ^)q<^<^, n > 1, he a sequence of i.i.d. copies of {St^) q^-^^, then the 
recursive procedure defined by (2.5) converges a.s. towards its target 5* = argmin^gjg ^^^^](7(5). 

This theorem is a straightforward application of the classical a.s. convergence for constrained 
stochastic algorithms (see Appendix A). In particular, the fact that in the original theorem the inno- 
vation process takes values in a finite dimensional space plays no role in the proof. 



2.2.2 Direct implementation on a historical high frequency dataset sharing averaging 
properties 

In this framework we will focus on the time discretized procedure i.e. on {St)t^[Q^T] rather than on 
('S'i)ie[o,T] itself. Keep in mind that, when directly implementing a high frequency dataset, then 

St = Stit £ [ti,ti+i),i = 0, . . . , m and St = St- 

and that the sequence {S^^'')o<i<m, n > 1, is usually obtained by shifting the data as follows: if At > 
denotes a fixed time shift parameter such that ti — = At = we set 

vtG[o,r], sP = Su,nAt = Su.^. 

We will assume that the sequence (5'|"^)o<j<m shares an averaging property with respect to a 
distribution v as developed in [21]. The definition is recalled below. 

Definition 2.1. Let men andv be a probability measure on ([0, L]™+\ ^or([0, Lj'^+i)). A [0,L]™+1- 
valued sequence (^n)n>i is v-averaging if 



1 " 
n ^-^ 

k=l 



^ (R™+i) 

^^j. =^ u as n ^ oo. 



7 



Then (^„)n>i satisfies 

DniO ■= sup -Y'lio.xlfe) - i^([0,x]) — ^0 asn^oo, 

where is cahed the discrepancy at the origin or star discrepancy. 

The resulting execution cost function C is defined by (2.2) where S is replaced by (<St)fg[o,T] whose 
distribution is entirely characterized by the distribution v. In some sense this function C is the best 
possible approximation of the true execution function C that we can get from the high frequency 
database. 

In this setting, we apply the previous results to the price sequence { (-S'if ^)Q<j<^, 't- > O}, i.e. we 
set for every n > 1, ^„ = {Stf') q^^^^^- In particular we will make the assumption that the dataset is 
bounded by a real number L G (0, +oo) so that ^„ G [0, L]"^"^^ for every n > 1 . Moreover, we will 
need to prove the existence of a pathwise Lyapunov function, which means in this one dimensional 
setting that H{-, ((stJo<i<m) is non-decreasing for every (st.)o<i<m G M™+^ n > 1. 

Theorem 2.2. Implementable procedure. Let \{x) = Ae~'^^ , A > 0, k > 0. Assume (5'^"^)„>i 
is an [0, L]"^'^^ -valued u-averaging sequence where u is a probability measure on {W^^^ , Bor{MJ^^^)) . 
Assume that the execution cost function C is strictly convex over [0, (5max] with C"(0) < and 
C'{5 

max) > 0. Finally assume that the step sequence (7n)n>i is positive non-increasing sequence 
satisfying 



E7„ = +00, nDl{S)-fn — ^ 0, and ni:>*(S') max (7^, |A7„+i|) < +oo. (2.7) 

n>l n>l 

Furthermore (having in mind that Sq = Sq), assume that 

Qt>2TX{-So) and (2.8) 

Then the recursive procedure defined by (2.5) converges a.s. towards its target 6* = argmin^g[Q^5^^^]C(5); 

Proof. We apply Theorem 2.1 Section 2 of [21] in a QMC framework similar to Appendix A. First we 
have that H{6*, •) G L^(P). Note that 6* G (0, (5max) since C"(0) < and C"(Jmax) > so we can extend 
C' as a convex function on the whole real line. Moreover, by using the proof of Proposition 4.1(6), we 
prove that if Qt > 2TX(—So) and (2.8) is satisfied, then H is non-decreasing in 6 so that H satisfies 
the strict pathwise Lyapunov assumption with L{5) = — 5*\'^ , namely 

V5gM\{5*}, VyGM"'+\ {H{6,y) - H{6*,y) \ 6 -6*) > 0. 

It remains to check the averaging rate assumption for H{6* , ■): as (si)i<i<m ^ H{6* , (si)i<j<m) is a 
non-decreasing function (for (sj)i<j<m < {s'i)i<i<mi i-e. < i < rn, Si < s-, i.e. lO,s} C [0, s'J), then 
H{5*, •) has finite variation and by using the Koksma-Hlawka Inequality, we get 

|-Vi/(r,5W)- / HiS\sHds) <{Hi6*,L)-H{5*,0))D:iS), 
so that H{5* , •) is i/-averaging at rate e„ = D^(S). Finally, Theorem 2.1 of Section 2 from [21] yields 
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Practical comments of the needed bounds. 



The constraint Qt > 2TX(—So) is structural: it only involves parameters of the model and the 
asked quantity Qt- It means that Qt does have some chances not to be fully executed before 
the end of a slice of duration T (i.e. the intensity of trades obtained very far away from the 
current price is smaller than Qt/2). 

The criterion involving the free parameter k is two-folded depending on the modeling of the 
"market impact". 

— The market impact does not depend on the remaining quantity to be traded {i.e. when 
$ = id or = which implies that ^{Qt) — ^{Qt ~ 1) = !)• This setting is appropriate 
for executing very small quantities or trading very liquid assets (like equity futures). Then 
the criterion on the free parameter k reads 

Sq — 5max , 1 

K < — n^^n 1- 



M k\\S\\ 

II 1 1 oo II 1 1 oo 

It states that in this case, the constant premium to pay for the remaining quantity [i.e. 
K, in basis points) has to be lower than the price range inside which we wish to trade 
(i.e. {Sq — Sraax)/ ll^'Hoo) plus a margin, namely l/{k II^H^^). It can be seen as a symmetry 
argument: a model where one cannot imagine buying at a lower price than a given threshold, 
one cannot either accept to pay (on the other side) more market impact than this very 
threshold. 

— The market impact of the remaining quantity is a function of the quantity. The interpreta- 
tion is very similar to the previous one. In this case a quantity homogeneous to the market 
impact {i.e. k- {^{Qt) — '^{Qt — ^)) in basis points) should not exceed (So — (5max)/ + 
1/(A: ||'5||j^)- Here again it is a symmetry argument: the trader cannot consider to pay more 
market impact for almost one share {i.e. ^{Qt) — ^{Qt — 1) plays the role of <^(1) but 
"taken around Qt") than his reasonable trading range {{Sq — (5max)/ H'S'IIq^^ in basis points 
again), plus a margin. Looking more carefully at this margin: l/(/c HSH^) (the same as 
in the constant market impact case), it can be read that it is in basis points, and means 
that if one considers large intensities of fill rates {i.e. k is small) then the trader can be 
lazy on the market impact constraint (because his margin, proportional to 1/k, is large in 
such a case), mainly because he will have to pay market impact not that often. If on the 
contrary, k is large {i.e. he will have remaining quantities) then he needs to really fulfill his 
constraint on market impact. 

The needed bounds to obtain the convergence of (5„ toward S* are thus not only fulfilled naturally. 
They also emphasis the consistency of the model and the mechanisms of the proofs. 



2.3 Criteria for the convexity and monotony at the origin 

In this section, we look for simple criteria involving the parameter k, that imply the requested assump- 
tion on the execution cost function C (or C). One important feature of this section is that we will 
never need to really specify the process S. For notational convenience we will drop the C notation. 

Checking that the assumptions on the function C {i.e. C convex with C"(0) < 0) in Theorem 2.1 
are satisfied on [0, Jmax] is a nontrivial task: in fact, as emphasized further on in Figures 2 and 7 
in Section 5, the function C in (2.2) is never convex on the whole non-negative real line, so we need 
reasonably simple criteria involving the market impact function Qt and the parameter k and others 
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quantities related to the asset dynamics which ensure that the required conditions are fulfilled by the 
function C. These criteria should take the form of upper bounds on the free parameter n. 

Their original form, typically those derived by simply writing C"(0) < and C"(0) > 0, are not 
really operating since they involve ratios of expectations of functionals combining both the dynamics 
of the asset S and the execution parameters in a highly nonlinear way. A large part of this paper is 
devoted to establish simpler criteria (although slightly more conservative) when {St)[Q^T] is a continuous 
process satisfying a functional co-monotony principle. 

We still need an additional assumption, this time on the function A. Roughly speaking we need 
that the functional A depends on the distance parameter 5 essentially exponentially in the following 



sense (which depends on S): 
< ^1 := P- essinf 



< A; 



£!i2 



<5e[0,5„ 



■ essinf 

5G[0,(5max 



^At(^,5) 
At(<5,5) 



^At(<5,5) 



^At(<5,5) 



< ki :-- 



< ko 



■ esssup 

5e[0,(5niax] 



■ esssup 

5e[0,(5niax] 



At(5,5) 



■At(<5,5) 



< +00, (2.9) 



< +00. (2.10) 



Note that the above assumption implies 



kn 



-esssup 



as 



At(0,5) 



At(0,5) 



-essinf 



At(0,5) 



> ^1 > 0. 



(2.11) 



Although this assumption is stated on the functional A (and subsequently depends on S), this is 
mainly an assumption on the intensity function A. In particular, the above assumptions are satisfied 
by intensity functions A of the form 



-kx 



X € 



Afc(x) -- 

For these functions A^, one checks that = = hi 



k € (0,+oo), 
ki = k2 = k2 



k. 



The key to establish the criteria is the functional co-monotony principle that we establish in the 
Appendix B for a wide class of diffusions and their associated time discretization schemes. 
A Borel functional F : D([0,T],M) — ?> M is non-decreasing if 



Va,/3 G B([0,r],M), (Vt G [0,T], a{t) < /3{t)) 



It is monotonic if F or —F is non-decreasing. Two functionals F and G on 
if they are monotonic with the same monotony. 
A functional F has polynomial growth if 

3r>0 s.t. Va G lD)([0,r],E), \F{a)\<K{l + 



F{a) < F{f3). 

([0, T], M) are co-monotonic 



a 



Definition 2.2. A stepwise constant cddldg (resp. continuous) process {St)te[o,T] satisfies afunctional 
co-monotony principle if for every pair F ,G : ID)([0,T],M) — > M of co-monotonic Borel functionals (resp. 
continuous at every a G C([0, r],M) for the sup-norm) with polynomial growth such that F{S), G{S) 
and F{S)G{S) G L'^ , one has 



E 



te[o,T] 



> E 



(2.12) 



We will use in the proofs below this principle for the price process {St)t£[o,T] ^or monotonic func- 
tionals with opposite monotony. Thus Inequality in (2.12) is reversed (all we have to do is to replace 
F by —F). This co- monotony principle is established and proved in Appendix B for a wide class of 
Brownian diffusions, their discrete time samples and their Euler schemes. 



The main dynamics in which we are interested are the following 
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1. An "admissible" Brownian diffusion in the sense od Definition B.2 satisfies a functional co- 
monotony principle. So does its Euler scheme with step at least for m large enough, say 
m > rrib^a, where m^^o- only depends on (the Lipschitz coefficients of) the drift b and the diffusion 
coefficient a of the SDE that {St)t£[o^T] is solution to. 

2. Finally any discrete time sample (S'tJo<i<m of a continuous process satisfying a functional 
co-monotony principle also satisfies a co- monotony principle for continuous functions f,g : 
l^m+i ]^ monotonic in each of their variables with the same monotony such that /((5't.)o<i<m); 

9{iSu)o<i<m), fg{{Su)o<i<m) G L^- 

3. If (5'j.)o<i<m is a Markov chain whose transitions 

Pig{x) = E [^(St^+i \St,=x], z = 0, . . . , m, 

preserve monotony (g non-decreasing implies Pig is non-decreasing), then {StJo<i<m also satisfies 
a co-monotony principle in the same sense as for discrete time sample. 

The proof of 1. is postponed in Appendix B (Theorem B.2). Admissible diffusions include standard 
Brownian motion, geometrical Brownian motion and most models of dynamics traded assets (see 
Section B.2. 3). 

Claim 2. follows by associating to a function / the functional F{a) = /((Q;(tj))o<i<m)- 
For claim 3., we refer to [23] Proposition 2.1. 

Theorem 2.3. Assume that iSt)t)=[o^T] satisfies a functional co-monotony principle. Assume that 
the function A is essentially exponential in the sense of (2.9) and (2.10) above. Then the following 
monotony and convexity criteria hold true. 

(a) Monotony at the origin: The derivative C(0) < as soon as 
Qt>2TX{-So) and k< ^ + h^o 



ko¥. [5t] ($(Qt) - ^(Qt - 1)) 
(6) Convexity. Let pq G ^0, 1 - p|j^)i^Q^~^j ^ 5 )) ' ^ ^ "•^•^^"^'^ ^^^^ ^ satisfies 

VxG [1,Qt-1], ^{x)-^{x-l) < pq{^{x + 1) -^{x)). 

2 

/ k \ 2k 

If Qt > 1 2v(l+-^))rA(-5o) and k<^ 



klk2') - kik2^[ST]%{QTy 

then C"{5) > 0, (5 G [0, 

'^max]; -50 that C is COnVCX on [0, 5jnax]- 

(c) The same inequalities hold for the Euler scheme of {St)t(^[Q^T] ^^^^ ^^^P for m > nib^a, or for 
any W^^^-time discretization sequence (5'tJo<i<m which satisfies a co-monotony principle. 

Remark. These conditions on the model parameters are conservative. Indeed, "sharper" criteria can 
be given whose bounds involve ratios of expectation which can be evaluated only by Monte Carlo 
simulations: 

C7'(0) < ^ < K < 62, 
^ E [-gTP(°) (A^'^ > Qt) + (5o^At(0, S) - At(0, S)) {N'' < Qt - 1)] 

where Oo — ;: o :; 

E[5t^At(0,5)v.(0)(/x)] 
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and 



C is convex on [0, dr, 



< K < min 



Ai5) 



where 



sev+ B{S) 

A{d) = E (^{So-5)^AT{S,S)-2^AT{d,S)j¥('HN'^<QT-l) 



{So-d)(-^AT{6,S) 



B{6) = E 



St (^At{S,S)<p('Hh) - (J-^At{6,S)^\^'\h) 



(ATM = Qt-1) 



and V+ = {6e [0, 6^^] \ B{6) > 0} 



3 Representations as expectations of C and its derivatives 

First we briefly recall for convenience few basic facts on Poisson distributed variables that will be 
needed to compute the cost function C and its derivatives C and C" (proofs are left to the reader). 

Proposition 3.1. (Classical formulas). Let (-/V'*)^>o be a family of Poisson distributed random vari- 
ables with parameter /x > 0. 



(i) For every function f : N 
d 



1+ such that log/(n) = 0{n), 



dfx 



E [/(iV^)] = E [/(iV^ + 1) - /(iV^)] = E 



f{N^ 



ATM 



In paHicular, for any keN, {N^' < k) = -¥ {N^' = k). 
For any k eW, 

(ii) E[kA N^'] = kF {m >k) + nF{m <k- 1) and ^E [A; A iV^] = P (AT^* < A; - 1), 

(Hi) E [{k - N^')^] = kF {N^' < k) - ^F {Nt" < k - 1), 
(iv) kF {Ni" = k) = |J,F{N^' = k-l). 

To compute the cost function (or its gradient), it is convenient to proceed a pre-conditioning with 
respect to := a{St,0 <t <T). We come down to compute the above quantity when A^^''^ is 
replaced by a standard Poisson random variable with parameter /it denoted N^. Therefore we have 



{So - S) (Qt a AT^f)) + kSt^ ((Qt - N^^^^ 



(5o-<5)E (QtAK 



J- X 



+ kStE 



+ 



Qt - N^^ 



J- T 



C{S) = E 

= E 
= E 

= K[C[S,AT{S,S),{St)o<t<T 
where for every x G C([0, T],M+) and every ^ € M+, 

C {5, II, x) = (xo - 5) {QtF (AT^ > Qt) + /iP (iV^ < Qr - 1)) + kxtE [$ {{Qt - N^')^)] . 
We introduce some notations for reading convenience: we set 

P(^) (iV^ >Qt) = F {N^^ > Qt) |^=a,(5,5) and e(^) [f{^^)] = E [/(/.)] |^=a,(,,5) • 
Now we are in position to compute the first and second derivatives of the cost function C. 



(3.13) 
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Proposition 3.2. (a) // $ / id, then C'{6) = E [H{6,S)] with 

H{5, S) = -QtP^^^ (iV^ > Qt) + (^At(<5, S){So -5)- At{5, S)^ p(^) (A^'^ < Qt - 1) 

-k5t^At(<5,5)(^(^)(m) (3.14) 
w;/iere ip'^^^fi) = E('^) [($ (Qt - N^') - <^ (Qt - N^' - 1)) l{7VM<Q;r-i}] 



^"(5) = E 



f (5o - 5)^At(<5, - 2^At(5, S)] F^'^ (TV^ < Qt - 1) - k5t^At(5, (/.) 
-(^0 - 5) (^At(5, 5)) ' P(^) (iV'' = Qt - 1) + ^5t (^At(<5, 5)) ' (/z) 

w/iere V^'^Ha*) = ^^^^ ((<5t - A^^ - 2)^) - 2$ ((Qt - iV^ - 1)^) + $ ((Qt - ■ 
(6) // $ = id, i/ien C'{6) = E [H{6, S)] with 

H{6, S) = -QtF^^^ {N>' > Qt) + (^{So - S - kSt)-^At{S, S) - At(<5, 5)^ p(^) {N" < Qt - 1) (3.16) 



and C"{6) = E 



{So-6- k5t)^At(5, S) - 2^At(<5, S)^ p(^) (iV^ < Qt - 1) 
{So-6- kSt) (^At(<5, 5)^ P(^) (A^^ = Qt - 1) 



(3.17) 



Proof. Interchanging derivation and expectation in the representation (3.13) imphes 

C'{6) = IE (<5, At{5, S), (5t)o<i<T) + (5, At(5, 5), (5<)o<,<^) ^At(5, 5) 

(a) We come down to compute the partial derivatives of C {6,i2,x). 

— (5, /i, x) = -E [(Qt a iV^)] = -QtP {N^^ > Qt) - /uP {N^ < Qt - 1) , 
— {6, ^, x) = -ixo- 6)—E [{Qt - A^^)+] + kxt^-K [$ ((Qt - Ar^)+)] . 



QtP (A^^ = Qt) - P (A^'' 

P (A^^ < Qt - 1) thanks to (iv) in Proposition 3.1 



d 

We have — E [(Qt - Ar'^)+] = -QtP (A^^ = Qt) - P (A^'' < Qt - 1) + ^P (A^^ = Qt - 1) 



and |-E [c^ ((Qt - A^^),,)] = E [$ ((Qt - A^^ - 1)+) - <I> ((Qt - Ar^)+)] 

= E [(<D (Qt -m-l)-<^ {Qt - m)) l{m<Q^-i}] ■= -^{p) 
owing to [v) in Proposition 3.1. Therefore 

dC 

— (5, ^, x) = (xo - 5) P {N^^ < Qt - 1) - KXT(^(/i). 
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Consequently 



C'{6) 



E 



-Q^p(5) (iv/^ > Q^) + (^-^At{5, S){So -6)- At{5, S)^ F^^'^ < Qt - I) 



where ip^^\fi) := <^(/^) |/x=At(<5,5) for every x G C([0,T],M+) and every /i, € M+, 

d {6, /X, u, x) = -QtF {N'' > Qt) + {v{xo - 5) - /x) P (iV^ < Qt - 1) - kxti^V3(/u). 
Interchanging derivation and expectation in the representation (3.18) impHes 

C"{6) 



(3.18) 



E 



U At{S, S), ^At(5, S), (5i)o<,<^ 



(5, Arid, S), ^AriS, S), (5t)o<,<2^) ^At{6, S) 
+ ("5, At{S, S), ^At(<5, 5), (50o<i<T) ^MS, S) 



We deal now with the partial derivatives of C {6, /i, z^, x). 

(iV^ < Qt - 1) , 

((5,/x,z^,x) = -¥{N^' <Qt-1)-{xo-6)i^¥{N^' = QT-l) + KXTlyi^i^l), 
{5, n, u, X 

Consequently 

C"{6) = E 



— {d,fx,v, x) 

dC_ 

djj, 

dv 



{xQ - (5)P {N^ <Qt-1)- i^XT^ifi). 



{So - <^)^At(<5, S) - 2^At(5,5)) P^-^) (7V^ < _ 1) _ ^5t^At(5, 5)^^') (/^) 
-(5o-5)(^At(5,5)) p(')(iV'' = QT-l) + ^5Tf^AT(5,5)) V^')(;u) 



(6) If ^> = id so that ((Qt - iV'')+)] = -IP (^'^ < Qt - 1). Therefore 

dC BC 

— {6, x) = -QtP (A^^ > Qt)-/uP (A^^ < Qt - 1) and — (5, ^, x) = {xo-6-kxt)F {N^' < Qt - 1) 



Consequently 

C'{5) = E 

= E 



-QyP(5) (ATM > Q^) + ( _ 5 _ ^5^)^At(<5, S) - At{5, S) ] P^'') < Qt - 1) 



C( 5,At(5, 5),^AT(<5,5),(5t 



(3.19) 
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where for every x € C([0, T], and every ^, € 

C (6, ^, 1^, x) = -QtF {N^' > Qt) + ((xq -5- kxt)^ - ^) F (iV^ < Qt - 1) • 
Interchanging derivation and expectation in the the representation (3.19) imphes 

C"{6) = E ^a(5,AT(5,S),^AT(5,5),(5iWT) 

[5, Arid, S), ^At{S, S), (5i)o<,<y) ^At{6, S) 

+ (5, At{6, S), ^At(5, 5), (50o<,<t) ^At(5, S) . 

We come down to compute the partial derivatives of C (^5, At{S, S), -^At{S, S), (>S't)Q<j<j. 
dC dC 

— (5, V, x) = {N^' < Qt - 1) , ^ (5, /x, i^, x) = (xq - 5 - kxt)P (A^^ < Qt - 1) , 

— {6, fi, V, x) = -P (AT^ < Qt - 1) - (xo - <5 - kxt)^At(<5, 5)P (A^^ = Qt - 1) ■ 
Consequently 

C"{5) = E (^{Sq-5-kSt)^At{5,S)-2^^At{8,S)^¥'^^'>{N^ <Qt-1) 

- (5o-5-k5t) (^Ar(5,5)^ P(^) (A^^ = Qt - 1) . □ 

4 Convexity and monotony criteria for the cost function C 

To ensure that the optimization problem is well-posed, namely that the cost function C has a minimum 
on [0, (^max]) we need some additional assumptions: the cost function C must satisfy C"(0) < 0. This 
leads to define bounds for the parameter k and this section is devoted to give sufficient condition on 
K to ensure that this two properties are satisfied. The computations of the bounds given below rely 
on the CO- monotony principle introduced in the Appendix Section B. 

4.1 Criteria for local and global monotony 

The proposition below gives bounds for the parameter k which ensure that the execution cost function 
C has a minimum. The aim of this subsection is to obtain sufficient bounds, easy to compute, namely 
depending only of the model parameters. 



Proposition 4.1. Assume that (5'j)^g[o,T] satisfies a co-monotony principle 

At(0,5) 



—A (0 S) 

a) Monotony at the origin. C"(0) <0 as soon as Qt > 2TX{-So), ko = infess ( -^7-777^ ) > ^ 



and 

1 + ^05*0 



K < 



koE [ST]mQT)-HQT-l)) 
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In particular, when $ = id, the condition reduces to 

1 + kQSo 



koE [St] ' 

(6) Global monotony (exponential intensity). Assume that s* := \\ sup St\\L°° < +00. // 

te[o,T] 

A(x) = Ae"^^', A> 0, k > 0, Qt > 2TX{-So) and 

. 1 + fc(5o - (^max) -f ^ ^ 1 + ^(5*0 - (5max) , . , . . „„x 

k7* "^ fe.*(c|>(Q^)-c^(Q^-l)) '^^ = '''^ (^-2°) 

i/ien H{-,{yi)o<i<m) is non- decreasing on [0, (5max] /o?' e?;ery (yi)o<i<m e [0,5*]""+^ 
(c) r/ie same inequality holds for the Euler scheme of {St)t£[o,T] ^^^^ ^^^P for rn > mi,^^, or 
for any W^^^-time discretization sequence {StJo<i<m which satisfies a co-monotony principle (see 
Theorem 2.3-{c) and Corollary B.l). 

To prove this result, we need to establish the monotony of several functions of // which appear in 
the expression of C . 



Lemma 4.1. (i) The function fi 1 — > fiF (N^ < Q) is non- decreasing on 



0, 



Q+l 
2 



(ii) The function /x ^ e(Q, fj.) := E - A^^*) - $(Q - N^' - 1) \ N^' < Q - 1] satisfies Q{Qt, < 
e(QT,0) = $(Q) - - 1) for all fi. 

Proof of Lemma 4.1. (i) We have — (^P (iV^ < Q)) = P (iV < Q) - /uP (iV = Q). Consequently 

^ Q 
|;(,.P(«"<«)>0 iff E^SMf. 

But k I — )• — is non-decreasing on {0, 1, . . . , I //J } and non-increasing on {l/ij, . . .}. 
k\ 

Hence XI TT - ^ TiT " ~ L^J) TTT' ^° X] TT - ^7^ ^ ^^^"^ as Q > 2[^J + 1. 
A— n ^' ^' /r-n ^' 

(zi) The function is non-decreasing, non-negative and convex with $(0) = 0. If we look at the 
representation of /u 1— )• A^^ by ^ 

N^{ijj) = max 

i=l 

where Ui are i.i.d. uniformly distributed random variables on the probability space (il,^, P), then 
(J, ^ is clearly non-decreasing, so fi Q — is non-increasing and fi 1— ?• (p{ji) = ^{Q — N^) — 
$(Q - A^^ - 1) too (because of the convexity of □ 

Remark. If /i G (0,1), then fj, 1 — > fi¥ (N^ < Q) is always non-decreasing. If G [1,2), then the 
function ji 1 — > jjF (iV^ = 0) = //e~^ is not always non-decreasing clearly, but only on [0, 1]. 

Proof of Proposition 4.1. (a) In our problem the intensity parameter fJ^ = ^{St — Sq -\- 6)dt is 

Jo 

continuous, non-increasing to zero when 6 tends to -l-oo and bounded by assumption (A(— 5o) < +00). 
Hence n G [0, A(-S'o)r]. 
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(i) Prom (3.18), we have for 6 = 0, 



0<t<T 



< [so^At{0, S) - At(0, 5)) P(0) (iV^ < Qt - 1) - kSt-^At{0, {^^) 
^At(0, 5) (5o - kStQ{Qt, At(0, 5))) - At(0, S)^ p(°) (A^^ < - 1) , 



because — QtP (A^^ > Qt) |^=Aj,(o s) ^ ^ small if Qt is large. Let feg ^-iicl ko be defined by (2.11) 



We have that > a.s. and ko > a.s. by assumption, i.e. -^At{0,S) < —kQAr^O, S) a.s. and 



^At{0,S) > -koAT{0,S) a.s.. Then 

)0<t<T 



< - (1 + kj^So - KSrkoQiQT, At(0, 5))) (/xP (AT^^ < Qt - 1)) 



:At(0,S) ■ 



Now, by Lemma 4.1, 



e(QT,/x) < G(Qt,0) = $(Q)-$(Q-1) = ^(0). 



Therefore a(o, At(0,5), ^At(0,5), (St)o<t<T^ 

< - (1 + ^0*50 - f^Srko mQr) - f (Qt - 1))) (^^ (A^'^ < Qt - 1)) |^=a^(o,s) ■ 

By Lemma 4.1, if Qt > 2rA(— ^o), then fi i — > /iP (A^^ < Q — 1) is non-decreasing. Moreover, the 
functionals 

F : ]D)([0,r,M) R 



a H> AT(0,a)= [ X{a{t) - So)dt 
Jo 



is non-increasing and 

a(T) I — y (-1 - kffSo + Ka{T)ko ($(Qt) - ^'(Qt - 1))) is non-decreasing. 

Both are continuous (with respect to the sup-norm) at any a G C([0,T],M). Conserquently, we obtain 
by the functional co-monotony principle 

E [(-1 - kj^So + KSrko mQT) - HQt - 1))) (a^P {N>' < Qt - 1)) i^,=Ar{o,s) 
< E [-1 - k^So + KSrko ($(Qt) - ^(Qt - 1))] E [(^P {N^^ < Qt - 1)) |^=a^(o,s) 
In turn, this implies 

C'{0) < E [-1 - kj^So + KSrko (f (Qt) - HQt - 1))] E [(^P (AT^ < Qt - 1)) [^=aho,5) 



As E 



I.e. 



(/iP (A^^ < Qt - 1)) |^=At(o,S) ^ 0' then C"(0) < as soon as 

E [-1 - koSo + KSrko ($(Qt) - $(Qt - 1))] < 0, 

1 + ^0-5*0 



K< ^ 



fcoE [5t] («'(Qt) - «'(Qt - 1)) 
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(6) From (3.18), the form of A and (a), we get that for every 6 £ [0,5max]5 5' = (5'j)i<i<m G M'"^-'^, 

Hi6, S) = -QtP(') (iV^ > Qt) + f{6, S) (^P (iV^ < Qr - 1)) |^=ah5,5) ' 
where S) = -l- k{So -6- kStQ{Qt - 1, At(<5, 5))) if $ id and f{6, S) = -l- k{So -5- kSt) 
if ^> = id. Since 6 ^ -QtP^^^ (iV > Qt) is non-decreasing, 6 ^ (//P(A^'^ < Qt - 1)) 



\^^=AT{s,s) 



IS 



non-increasing and non positive owing to Lemma 4.1 (i) and 5 i— )• 5) is non-decreasing owing to 
Lemma 4.1 (ii), 6 i-)- H{5,S) is non-decreasing if f{6,S) > 0, 5 G [0,5max]5 G R*""^^, which leads to 
(4.20). □ 

4.2 Sufficient condition for the convexity condition 

We give below sufficient condition to insure the convexity of the execution cost function C, namely 
conservative upper bound for the free parameter k. 



Proposition 4.2. Assume that {St)te[o,T] satisfies a co-monotony principle. 

ff^<Qj.-l) |^=TA(-5o) 

VxG [1,Qt-1], $(x)-«>(x-1) < pgi^x + l) -<^{x)). 



(z) If ^ ^ id, assume that there exists pg G (0, 1 — p[jVA'<o^-ii ^ ^ ) -^^c/i that 



-2 



// Qt > 2 V 1 + -^) rA(-5o) and k < ^ 

where is the left derivative of the convex function ^, then C"{6) > 0, (5 G [0, Jmax]; so thdt C is 
convex on [0, 

[ii) When <I> = id, the hound on k reads 

2k, 



K < 



kik2E[ST] 

Remark. Note that if we replace the diffusion process («S'()jg[o,T] by its stepwise constant Euler 
scheme or by a discrete time sample ('S'j.)o<i<m) we have the same criteria owing to Corollary B.l (see 
also Theorem 2.3- (c)). 

To prove the above Proposition, we need the following results 
Lemma 4.2. If p < Q — 1, then p i-^ p(jv>'<Q-i) ^"^ non- decreasing. 
Proof of Lemma 4.2. 

dF{Ni' = Q-l) _ F{N^' = Q -2) -FiNt" = Q -1) ^F{Nf = Q -if 



dp P (iVA* < Q - 1) P(iVM<Q-l) P(iVM<Q-l) 



2 



P (ATM = g _ 1) (ATM < Q _ 1) _ Ij + p (ATM = Q _ 1) 

F{N^'<Q-lf 

>Oifp<Q-l. n 
Lemma 4.3. Assume that <I> ^ id. If there exists pn G ( 0, 1 — m^t1^9>^ 1^ I such that 

VxG [1,Qt-1], $(x)-^>(x-l) <pq(«>(x + 1)-$(x)), 
then p I—)- p(jVM<Qj-i) ^'^ non-increasing where ip{p) = ^{Q — N^) — ^{Q — — 1). 



18 



Remark. If $ = id, then /i H- p(jVM'<Qj^i) = 1' therefore we do not need the previous lemmas. 
Proof of Lemma 4.3. We have 

d/iP(iVA'<QT-l) P (iVA* < - 1)2 P(iVM<Q2.-l) - 

where V(^) = <^ {{Qt - iV^ - 2)^) - 2^> ((Qt - iV^ - 1)+) + <^ {{Qt - N^)+) 

iff (A^^ < Qt - 1)+P {N^ = Qt-1) ^{fi) < 

iff P (iV^ < Qt - 1) E ((Qt - iV^ - 1)+) - ((Qt - iV^ - 2)^)] < P (TV^ < Qt - 2) (^(//). 
But 

$ ((Qt - iV^ - 1)+) - <!> ((Qt - iV^ - 2)+) 

< pQ (<!> ((Qt - iV^)+) - $ ((Qt - iV^ - 1)+)) l{iVM<Q^-2} 

= PQ (<!> ((Qt - iV^)+) - <!> ((Qt - - 1)+) - (<J>(1) - f (0)) 1|;vm=q^-i}) 

< pQ((i)((QT-iV^)+) -'J>((QT-iV^-l)+)) 

since ($(1) - «>(0)) 1|7vm=q^-i} > a.s. 

Consequently 

P (iV' < Qt - 1) E [$ ((Qt - - 1)+) - $ ((Qt - iV^ - 2)+)] - P (TV^ < Qt - 2) (^(;u) 
< (/5QP(iV^<QT-l)-P(iV'^<QT-2))¥^(M) 

Proof of Proposition 4.2. By using the notation (2.9)-(2.10), we obtain the following bound for 
the second derivative of the cost function 

C"{5) > E hk^AriS, 5)P('^) (A^'^ < Qt - 1) 

■7:2 

+(5o - 5)k,k^AT{5, S) (p(^) (7V^ < Qt - 1) - -rhMS, 5)p(^) (iV^ = Qt - 1) 

"^1 2 



-^c5tAt('5, 5) (A;ifc2</'^'^^(M) - Ai?At(5, 5)^(^)(m) 

By adapting the result of Lemma 4.1, we obtain, if Qt > ^1 + -^j-^TX{—So), that 

7:2 

(p(iV'' < Qt - 1) - jrirf^^i^^ = Qt - 1)) , 



E 



=At(5,S) 



> 



and by convexity of the penalty function <I>, we have i^^^Hn) > a.s. Then we obtain the following 
upper bound for k, 



K < 



2kiK 



(^P(iVM<QT-l))|^=A,(5,s) 



kik2E [5tAt(<5, S)ip(^)ifi)] 



(4.21) 
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By Lemma 4.3, i— )■ ^(jv^'^^QT-i) non-increasing and by Lemma 4.1 fx fjF {Nf^ < Qx — 1) is 
non-decreasing for Qj- > 2 [/xj — 1. Furthermore a i— > AT{S,a) is non- increasing. By applying the 
functional co-monotony principle, we then have, for Qt > 2rA(— 5o), that 



Therefore (4.21) will be satisfied as soon as 

K < 



{f^^{N^<QT-l))\,=A,^S,S) 



E 



¥{N^' <Qt-1)^ Im=At(5,s) 



2k, 



St 



y(M) 



Since by Lemma 4.3, fi H- p(Nf<Qj,-i) is non-increasing, we get 



< 



' {Ni" <Qt-1)^ Im=At(<5,5) ~ VP (iVM < - 1) ; 1^=0 

which finally yields the announced (more stringent) criterion 

< 2^1 
^ - kik2E[ST] $'(Q)' 



^Q+)-^{Q-l)+)<^',{Q), 



□ 



Remark. As 5 G [0, 5max], then (Sq - S) G [Sq - 5max, ^o] and 

(So - 5)k,k^ATi5, S) (P(^) {N^' < _ i) _ j^^At{6, S)f'^'^ (iV^ = Qt - 1) ) 



t:2 

> (5o - 6m..)hk2ATi6, S) (p(^) (iV^ < Qt - 1) - ]^^^Ti6, 5)p(^) (A^'^ = Qt - 1) 



t:2 

{So - 6m.^)k,k^ (iV^ < Qt - 1) - 777r/"IP (^"^ = Qt - 1) 



|/i=AT{5,S) 



Unfortunately we cannot use the functional co-monotony principle to improve the bound because, 
for Q > ^2 V ^1 + ^^^^rA(— ^o), the function /u i-7> /iP (iV^ < Qt — 1) is non-decreasing and fi 

1 — ^^^li^^Q^Ii-^ is non-increasing, and we need to obtain a lower bound for this expression but 
co-monotony naturally yields an upper bound. 

5 Numerical experiments 

In this section, we present numerical results with simulated and real data. We first present the chosen 
model for the price dynamic and the penalization function. Within the numerical examples, we are 
modeling the optimal behavior of a "learning trader" reassessing the price of his passive order every 
5 units of time (can be seconds or minutes) in the order books to adapt to the characteristics of the 
market (fair price moves St and order fiow dynamics Nt). During each period of 5 seconds, the trader 
posts her order of size Q5 in the book at a distance 5 of the best opposite price {5 lower than the best 
ask for a buy order), and waits 5 seconds. If the order is not completely filled after these 5 seconds (say 
at time T), the trader cancels the remaining quantity (Q5 — A's)^ and buys it using a market order 



20 



at price St plus a market impact; she will buy at kSt (1 + fl{{Qb ~ -^5)+))- Then she can reproduce 
the experiment choosing another value for the distance to the best opposite 5. 

The reassessment procedure used here is the one of formula (2.4) using the expectation represen- 
tation of C given by Property 3.2 to provide the proper form for the function H. 

Then we plot the cost function and its derivative for a trivial penalization function $ = /d (r/ = 0) 
and for a nontrivial one. We conclude by giving the results of the recursive procedure in both cases, 
either on simulated data or on a real data obtained by "replaying" the market. 



5.1 Simulated data 



We assume that dSt = adWt, Sq = sq and At{S,S) = a[ e-^^^'-^°+^Ut 

Jo 

where {Wt)t>o is a standard Brownian motion and a,A,k > (this means that A(x) = Ae~^^). We 
denote here by {St)t>o the Euler scheme with step ^ of {St)t£[o,T] defined by 



Sk+i := Sk + a\ —Zk+i, 5o = So, Zk+i ^ M{0,1), k>0, 
V m 

and we approximate At{S, S) by Ar(5, 5) = YlT=o e~''^^''~^°~^^\ The market impact penalization 

function is ^{x) = (1 + ri{x))x with ri{x) = A'e^'^. Now we present the cost function and its 
derivative for the following parameters: 

• parameters of the asset dynamics: sq = 100 and a = 0.01, 

• parameters of the intensity of the execution process: A = 5 and k = 1, 

• parameters of the execution: T = 5 and Q = 10, 

• parameters of the penalization function: k = 1, A' = 0.1 and k' = 0.05. 

We use m = 20 for the Euler scheme and M = 10000 Monte Carlo simulations. 



Settin g 1 (?? ^ 0) 



Cost function 



Derivative 




0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1 
5 S 

Figure 2: 77 ^ 0: T = 5, A = 5, = 1, so = 100, a = 0.01, Q = 10, k = 6, A' = 1, k' = 0.01, m = 20 
and M = 10000. 



21 



Settin g 2 (r? = 0) 



Cost function 



Derivative 




0.01 0.02 0.03 0.04 0.05 0.06 0.07 
5 



0.04 0.06 
6 



Figure 3: r/ = 0: T = 5, A = 5, A: = 1, sq = 100, a = 0.01, Q = 10, k = 12, m = 20 and M = 10000. 



Remark. When one looks at the cost functions in Figures 2 and 3 (left), one may think that it would 
be simpler to compute the cost functions to derive the minimum and the associated optimal distance. 
But the computing time of the costs is about 100 seconds which is too large compared to the length of 
the posting period T = 5 seconds, whereas that of the stochastic recursive procedure is about 1 second. 

Now we present the results of the stochastic recursive procedure for the two settings with 



n = 100 and 7„ 



Settin g 1 {r] ^ 0) 



stochastic Approximation 




50 100 150 200 250 300 350 400 450 500 

time (s) 



lOOn 



Fair and posting prices 




Figure 4: 77 ^ 0: T = 5, A = 5, k = 1, sq = 100, a = 0.01, Q = W, k = 6, A' = 1, k' = 0.01, m = 20 
and n = 100 
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Settin g 2 {r] = 0) 



Stochastic Approximation 




Fair and posting prices 




U 5U lUU 13U ZUU ZtDU LiUU LiSU 4UU 4bU bUU iiUU tiUU HUU 1 UUU liiUU 14UU 1 bUU \UUU ZUUU 

time (s) time (s) 

Figure 5: r? = 0: T = 5, A = 5, = 1, sq = 100, a = 0.01, Q = 10, k = 12, m = 20 and n = 100 



5.2 Market data 

The self-adaptiveness nature of this recurrence procedure allows to implement it on real data, even if 
they are not exactly fulfilling the model assumptions. In the numerical example of this section, the 
trader reassess his order using the previously exposed recurrence procedure on real data on which the 
parameters of the models (A:, A, k, k' , A') have been beforehand fitted. 

As market data, we use the bid prices of Accor SA (ACCP.PA) of 11/11/2010 for the fair price 
process {St)t£[o,T]- We divide the day into periods of 15 trades which will denote the steps of the 
stochastic procedure. Let A'cycios be the number of these periods. For every n € A'cycies, we have 
a sequence of bid prices (5j^"''*)i<i<i5 and we approximate the jump intensity of the Poisson process 
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i=2 



Arn (,5, S) , where = Ei=i by 
The empirical mean of the intensity function 
is plotted on Figure 6. 



^ -^cycles 

At- (5, 5) 



cycles 



n=l 



Intensity 




Figure 6: Fit of the exponential model on real data (Accor SA (ACCP.PA) 11/11/2010): A = 1/50, 
k = 50 and iVcycles = 220. 
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The penalization function has the fohowing form 

^{x) = (1 + ri{x))x with r]{x) = A'e'^'''. 

Now we present the cost function and its derivative for the fohowing parameter values: A 
k = 50,Q = 100, A' = 0.001 and k' = 0.0005. 



1/50, 



Setting!, [t] ^ 0) 



Cost function 



Derivative 




Figure 7: ?? ^ 0: ^ = 1/50, A; = 50, Q = 100, k = 1, A' = 0.001, k' = 0.0005 and iVcycics = 220. 
Setting2_{r] = 0) 



Cost function 



Derivative 




Figure 8: ?? = 0: A= 1/50, A; = 50, Q = 100, k = 1.001 and iVcycies = 220. 

Now we present the results of the stochastic recursive procedure for two cases. To smoothen the 
behavior of the stochastic algorithm, we use Ruppert and Poliak's averaging principle (see [8]). In 
short, this principle is two-folded: 

- Phase 1: Implement the original zero search procedure with 7™ = ^ < P < 1) 7i > 0, 

- Phase 2: Compute the arithmetic mean at each step n of all the past values of the procedure, 
namely 



1 



n > 1. 



fc=0 
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It has been shown by several authors that this procedure under appropriate assumptions is ruled by 
a CLT having a minimal asymptotic variance (among recursive procedures). 



Settin g 1 {rj ^ 0) 



Stochastic Approximation 




Fair and Posting Prices 




lime (s) 



Figure 9: ?? ^ 0: A = 1/50, A; = 50, Q = 100, k = I, A' = 0.001, k' = 0.0005 and iVcycies = 220. Crude 
algorithm with 7„ — ^ 



550n ■ 
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Settin g 2 (r? = 0) 



stochastic Approximation 

r~ 



Fair and Posting Prices 




1 1.5 

time (s) 



Figure 11: r? 0: A = 1/50, = 50, Q = 100, k = 1.001, 7„ = and iVcydns = 220. 

Stochastic Approximation 



450n "-^^^ cycles 
Fair and Posting Prices 




time (s) 



Figure 12: r/ ^ 0: A = 1/50, /c = 50, Q = 100, k = 1.001 and iVcycies = 220. Ruppert and Poliak's 
averaging algorithm with 7„ = ^gg^o.as ■ 



We see on Figures 10 (for r/ ^ 0) and 12 (for = 0) that the recursive procedures converge toward 
their respective targets, namely the minimum of the execution cost functions presented in Figures 7 
left (for ^ 0) and 8 left (for = 0). 



5.3 Comparisons and comment on the efficiency of the reassessment rules 

Practically, the stochastic algorithm defined by the dynamics of equation (2.4) has to be read as a 
reassessment rule to apply to the distance to the reference price (which can be taken as the best 
opposite) at a given frequency (that can be expressed in calendar time or business time, i.e. number 
of trades or traded quantities): each cycle (of given 5 seconds), the trading algorithm updates 5n and 
modifies the price of its limit order to be the best opposite price minus (5„ (for a buy order). 

The change from 5n-i to J„ is computed using the expression (3.14) for H, and a typical choice 
for the step \s = lo/nP (0 < p < 1). 

Comparing the results obtained on simulated data (subsection 5.1) to the ones obtained on real 
data (subsection 5.2), the first point to notice is that the convergence is ten times slower on real data 
than on simulated ones (less than 10 minutes on real data vs less than one minute on simulated ones). 

The second element is that once the algorithm succeeded in being close to the optimal value 6* , 
it oscillates around it without any need to wait again from one to ten minutes to converge. It means 
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that using a heuristic rule to choose 6q rather than a random value can practically avoid "paying" this 
first step. 

Another important element to highlight is that the posting price dynamics on simulated data 
are really different than the ones on real data. On simulated data, the price evolution diagrams are 
unrealistic for any trader. It mainly comes from the fact that the price does not behave like a Brownian 
motion at a very short time scale. On real data, the posting price adapts to it, being also far more 
realistic by following the last prices more closely than in a simulated environment. 

It shows the adaptiveness property of a stochastic algorithm, that may need more time to catch a 
reasonable value for 6n, but then keep very close to it, despite the price moves that are less smooth 
than a classical diffusion. 

Going back to the expression of H, it must be qualitatively said that: 

• the first term -Qt^^^^ (iV > Qt) will push the price away when the order has been completely 
filled before the end of the reload period T; 

• the second term (^At(5, S){So -6)- At{5, S)) F^^'> {N^' < Qt - I) will attract the price to the 
best opposite as far as the price move between St and St+r does not change the intensity of the 
"order flow" A{6n, St+r) it will be exposed to; 

• the last component of the reassessment policy —kSt^A.t{S, S)(p^^\fi) attracts the price to the 
best opposite when the "price impact cost" (i.e. coming into the equation via K(p^^\ which can 
bee seen as the first derivative of the market impact k<I>) to pay when the expected fill rate has 
not been obtain, is too high. 

This qualitative interpretation of the update rule obtained by rigorous derivation of criteria (2.2) 
shows how to estimate and optimally mix these three natural effects that a trader would like to see 
into any price reassessment policy. It shows the value of a stochastic algorithm approach can bring to 
optimal trading at every time scale: (1) clearly links an objective criteria to the reassessment policies, 
(2) exposing the assumptions needed to guarantee its convergence. It is clearly the risk-control role 
that one can expect from a formalized approach of trading. 

6 Conclusion 

This paper presents a rigorous proof of convergence of a reassessment scheme for a trading tactic 
aiming at capturing liquidity around the book^\ It implements a learning by trading approach, 
validated inside our class of model (which is quite general) : 

• the distance to a reference point (in practice it can be the best opposite, the mid point, or any 
efficient price estimate) is fixed at 5„ during few seconds or market trades, 

• the tactic observes the market feedback resulting from the combination of the natural diffusion 
of the reference price and a point process filling my order with an intensity depending on my 
instantaneous distance to the reference price (which varies), 

• our formal results give the optimal way to adjust the distance 6n+i to the reference price, given 
then marginal variations of the different market components which can be anticipated to an 
increase or decrease of posting distance. 
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The robustness of the approach is not only guaranteed theoretically (provided that the market 
impact is in a realistic range, as commented on section 2.2.1), but also confirmed and emphasized 
by tests carried out on real data (partially reproduced in Section 5.2). This benchmark shows that 
even if the real data behave differently from Monte-Carlo generated scenarios (see Section 5.1), the 
convergence still occurs. 

This paper strongly suggests that such iterative trading procedures, very often used by practitioners 
because of the way it can be efficiently fitted on line to real time data and providing optimal reassess- 
ment rules. This has to be compared to a stochastic control approach which needs to be calibrated 
on using data on longer time frames inducing somehow an "averaging" effect of the instantaneous 
liquidity effects that can occur on the real markets. 

With the recent modifications of market microstructure following fragmentation of markets and 
emergence of high frequency trading, it is clear that algorithmic traders will need to devise more 
reactive and short term tactics. Covering this aspect of trading, this paper opens the door to other 
researches (like multi trading pools and multi asset reassessment of limit prices), and to applications 
for practitioners. Traders can use such scheme as sub-tactics of a brokerage algorithm, of an high 
frequency market making mechanism, or of any intraday arbitrage automated process. 
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Appendix 



A Convergence theorem for constrained algorithms 



The aim is to determine an element of the set {0 € G : h{6) = E [H{9, Y)] = 0} (zeros of h in Q) 
where 6 C M"^ is a closed convex set, /i : M'^ — > R*^ and H : x ^ W^. For 6*0 € 9, we consider 
the M'^- valued sequence {6n)n>o defined by 



On+i = Proje {9n - 7„+iF(6'„,y„+i)) , 



(A.22) 



where (l^)n>i is an i-i-d. sequence with the same law as Y, (7n)n>i is a positive sequence of real 
numbers and Projg denotes the Euclidean projection on 0. The recursive procedure (A.22) can be 
rewritten as follows 

0n+l =0n- Jn+lh{9n) " 7n+lAM„+i + 7„+ip„+i, (A.23) 

where AM„+i = H{9n, Yn+i) - h{9 ^) is a martingale increment and 



1 



Pn+l 



Proje i^n - -in+lH{9n,Yn+l)) 



ln+1 



+ H{9n,Yn+l). 



Theorem A.l. (see [18] and [19]) Let {9n)n>o be the sequence defined by (A.23). Assume that there 
exists a unique 9* & @ such that h{9*) = and that the mean function satisfies on Q the following 
mean-reverting property, namely 



y9^9*G@, {h{9)\9-9*) >0. 
Assume that the gain parameter sequence {'Jn)n>i satisfies 

^7,i = +oo and ^7^<+cx). 



n>l 



H{9,Y)f <K{l + \9\^), 



(A.24) 



(A.25) 



(A.26) 



n>l 

// the function H satisfies 

3K>{) such that V6I G 9, E 
then 

a a.s. 

Ufi r o . 

Remark. If B is bounded (A.26) reads sup^ggE \H{9^Y)^ < +oo, which is always satisfied if is 



compact and i— ?• E 



\Hi9,Y)\' 



is continuous. 



29 



B Functional co-monotony principle for a class of one-dimensional 
diffusions 



In this section, we present the principle of co-monotony, first for random vectors taking values in a 
nonempty interval /, then for one-dimensional diffusions lying in I. 

B.l Case of random variables and random vectors 

First we recall a classical result for random variables. 

Proposition B.l. Let f,g : I C M ^ M be two monotonic functions with same monotony. Let 
X : {U,A,F) L be a real valued random variable such that f{X),g{X) G L^(P). Then 

Cov(/(X),5(X)) >0. 

Proof. Let X, Y be two independent random variables defined on the same probability space with 
the same distribution ¥x- Then 

{f{X)-f{Y)MX)-giY))>0 
hence its expectation is non-negative too. Consequently 

E [f{X)giX)] - E [f{X)giY)] - E [f{Y)g{X)] + E [f{Y)g{Y)] > 

so, using that Y = X and Y, X are independent, yields 

2E [f{X)g{X)] > E [f{X)] E [g{Y)] + E [f{Y)] E [g{X)] = 2E [f{X)] E [g{X)] 

that is Cov{f{X),g{X)) > 0. □ 

Proposition B.2. Let F, G : M.'^ ^ M. be two monotonic functions with same monotony in each of their 
variables, i.e. for every i € {1, . . . , d}, xi i — > F{xi, . . . , Xj, . . . , Xd) and Xi i — > G{xi, . . . ,Xi, . . . , x^) 
are monotonic with the same monotony which may depend on i (but does not depend on (xi, . . . 
Xj+i, . . . , Xrf) G W^^"^). Let Xi, . . . ,Xd be independent real valued random variables defined on a 
probability space (J7,^,P) such that F(Xi, . . . ,Xd),G(Xi, . . . G L'^{¥). Then 

CoY{F{Xi,...,Xd),G{Xi,...,Xa))>0. 

Proof. The proof of the above proposition is made by induction on d. The case d = 1 is given by 
Proposition B.l. We give here the proof for d = 2 for notational convenience, but the general case of 
dimension d follows likewise. By the monotonic assumption on F and G, we have for every X2 G M, if 
X[ = Xi with X[, Xi independent, that 

{F{Xi,X2)-F{X[,X2)) {G{Xi,X2)-G{X[,X2)) > 0. 
This implies that (see Proposition B.l) 

Cov(F(Xi,X2)G(Xi,X2)) > 0. 

If Xi and X2 are independent, using Fubini's Theorem and what precedes, we have 
E[F(Xi,X2)G(Xi,X2)] = / Px2(dx2)E[F(Xi,X2)G(Xi,X2)] 

> / ¥x,{dx2)^[F{Xi,X2)]^[G{Xi,X2)]. 
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By setting ip{x2) = E [F{Xi,X2)] and Tp{x2) = E [G{Xi,X2)] and using the monotonic assumptions on 
F and G, we have that ip and il^ are monotonic with the same monotony so that 



Fx,idx2)K [FiXuX2)] E [GiXi,X2)] = E MX2)^{X2)] > E MX2)] E [^{X2)] . 
Combining these above two inequahties finahy yields Gov {F{Xi, X2)G{Xi, X2)) > 0. □ 
B.2 Case of (one-dimensional) diffusions 

This framework corresponds to the infinite dimensional case and we can not apply straightforwardly 
the result of Proposition B.l: indeed, if we define the following natural order relation on D([0,T],M) 

Vai,Q2 € B([0,T],M), ai < a2 ^ {Vt € [0,T], ai{t) < a2{t)) , 

this order is partial which makes the formal proof of Proposition B.l collapse. To establish a co- 
monotony principle for diffusions, we proceed in two steps: first, we use the Lamperti transform to 
"force" the diffusion coefficient to be equal to 1 and we establish the co-monotony principle for this 
class of diffusions. Then by the inverse Lamperti transform, we go back to the original process. 

In this section, we first present our framework in more details. Then we recall some weak conver- 
gence results for diffusion with diffusion coefficient equal to 1. Afterwards we present the Lamperti 
transform and we conclude by the general co-monotony principle. 

Let I be a nonempty open interval of M. One considers a real-valued Brownian diffusion process 

dXt = b{t,Xt)dt + a{t,Xt)dWt, Xq = xq G I, t G [0,r], (B.27) 

where 6, a : [0, T] x / — > M are Borel functions with at most linear growth such that the above 
Equation (B.27) admits at least one (weak) solution over [0, T] and is a Brownian motion defined 
on a probability space {il,A,f). We assume that the diffusion X a.s. does not explode and lives in 
the interval /. This implies assumptions on the function b and a especially in the neighborhood (in 
/) of the endpoints of / that we will not detail here. At a finite endpoint of /, these assumptions are 
strongly connected with the Feller classification for which we refer to [17] (with a{t, •) > for every 
t G [0,r]). We will simply make the classical linear growth assumption on b and a (which prevents 
explosion at a finite time) that will be used for different purpose in what follows. 

To "remove" the diffusion coefficient of the diffusion X, we will introduce the so-called Lamperti 
transform which requires additional assumptions on the drift b and the diffusion coefficient a, namely 



(Aa) 



(i) a eC\[0,T]x 

(u) V(t, x) G [0, T] X /, |6(t, x)\ < C(l + |x|) and < a{t, x) < C(l + |x| 



{iii) Vx G /, I(-oo,x]nl a{t,^) ~ f[x,+oo,)nl a{t,^) 



^ - r A - +00 



(B.28) 



Remark. Condition (iii) clearly does not depend on x G /. Furthermore, if / = M, (iii) follows from 
(ii) since > 

Before passing to a short background on the Lamperti transform which will lead to the new 
diffusion deduced from (B.27) whose diffusion coefficient is equal to 1, we need to recall (and adapt) 
some background on solution and discretization of such SDE. 
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B.2.1 Background on diffusions with a = 1 (weak solution, discretization). 

The following proposition gives a condition on the drift for the existence and the uniqueness of a weak 
solution of a SDE when a = 1 (see [16] Proposition 3.6, Chap. 5, p. 303 and Corollary 3.11, Chap. 5, 
p. 305). 

Proposition B.3. Consider the stochastic differential equation 

dYt = /3{t,Yt)dt + dWt, t E [0,r], (B.29) 

where T is a fixed positive number, W is a one- dimensional Brownian motion and f3 : [0, T] x M — > M 
is a Borel-measurable function satisfying 

\f3{t,y)\<K{l + \y\), t e [0,T], y G M, K > 0. 

For any probability measure u on (M, ;B(M)), equation (B.29) has a weak solution with initial distribu- 
tion V . 

If, furthermore, the drift term (3 satisfies one of the following conditions: 
(i) f3 is bounded on [0, T] x M, 

(ii) 13 is continuous, locally Lipschitz in y € M uniformly in t & [0)^]? 
then this weak solution is unique (in fact (ii) is a strong uniqueness assumption). 

Now we introduce the stepwise constant (Brownian) Euler scheme y™ = ( Ykr I with step 

V m/0<fc<m 

^ of the process Y = (yt)tg[o,T] defined by (B.29). It is defined by 

Yt^^,=yt^+m,Yt^)- + \ -Uk+i, Yo = Yo = yo, = 0, . . . , m - 1, (B.30) 
fc+i k m \ m 

where = k = 0, . . . ,m, and {Uk)o<k<m denotes a sequence of i.i.d. 7\A(0, l)-distributed random 
variables given by 

Uk = ^{w,^-W,^_;), k = l,...,m. 

The following theorem gives a weak convergence result for the stepwise constant Euler scheme (B.30). 
Its proof is a straightforward consequence of the functional limit theorems for semi-martingales (to be 
precise Theorem 3.39, Chap. IX, p. 551 in [15]). 

Theorem B.l. Let (3 : [0,T] x R ^ M be a continuous function satisfying 

3K>0, |/3(t,y)| <i^(l + |y|), t G [0,T], y € M. 

Assume that the weak solution of equation (B.29) is unique. Then, the stepwise constant Euler scheme 
of (B.29) with step ^ satisfies 

ym — ^ Y for the Skorokhod topology as m —)• oo. 

In particular, for every functional F : B([0, T],M) — > M Py-a.s. continuous at a G C([0,T],]R), with 
polynomial growth, we have 

EF(y"') — > EF(y) 

(by uniform integrability since sup^gp^T] \Y^\ € CIp^qL^)- 
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B.2.2 Background on the Lamperti transform 

We will introduce a new diffusion Yt := L(t, Xt) which will satisfy a new SDE whose diffusion coefficient 
will be constant equal to 1. This function L defined on [0,T] x 7 is known in the literature as the 
Lamperti transform. It is defined for every {t, x) G [0, T] x 7 by 



L{t,x):= r (B.31) 



where Xi is an arbitrary fixed value lying in /. The Lamperti transform clearly depends on the choice 
of Xi in I but not its properties of interest. First, under {Ab,a)-{i)-{ii), L G C^'^([0,r] x I) with 

dL, , r 1 da , ,^ dL ^ ^ 1 d'^L , ^ 1 da , ^ 

Let t G [0,T], L{t,-) is an increasing C^-diffeomorphism from I onto M = L{t,I) (the last claim 
follows from (At, a) -{Hi))- Its inverse will be denoted L~^{t, ■). 

Notice that, {t,y) i-> L~^{t,y) is continuous on [0,r] x I since both sets 

{{t,y) e[0,T]xI : L-\t,y) < c} = G [0,r] x M : L(t,c) > y} 

and 

{{t,y) G [0,r] X 7 : L-\t,y) > c} = {{t,y) G [0,T] x M : L{t,c) < y} 
are both closed for every c G M. Therefore, if {Ab,a) holds, the function /3 : [0, T] x 7 R defined by 



Kt,y) :-- 



is a Borel function, continuous as soon as b is. 

Now, we set ViG[0,T], Yt := L{t, Xt). 

ltd formula straightforwardly yields 

dYt = P{t, Yt)dt + dWu Yq = L(0, xo) =: yo e M. (B.33) 
Remarks. • In the homogeneous case, which is the most important case for our applications, 

dXt = h{Xt)dt + a{Xt)dWt, = a;o G M, t G [0, T], (B.34) 

we have 

L{t,x) = L{x) := r ^ 

Jxi 

Then by setting Yt := L{Xt), we obtain 

dYt = P{Yt)dt + dWt, Yq = L{xo) =: yo with ^ := (^^ - o L-\ 
Note that B is bounded as soon as - — ^ is. 

• If the partial derivative 6^ exists on [0, T] x 7, one easily checks, using (L~^)^(t, y) = a{t, L~^{t, y)), 
that for every (t, y) G [0, T] x 7, 

fi%y) = - - "-^){t,L-\t,y)). (B.35) 
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As a consequence, as soon as the function 



^ (T 2 



is bounded on [0, T] x /, tlicn /3 satisfies the linear growth 
< Lipschitz assumption (B.36) 
> 0, then P is non-decreasing. 

Definition B.l. The functional Lamperti transform, denoted A, is a functional from C{[0,T],I) to 
C([0,T],M) defined by 

Va G C([0,r],/), A(a) = L(-,a(-)). 

Proposition B.4. // the diffusion coefficient a satisfies {Ab,a), the functional Lamperti transform is 
an homeomorphism from C([0, T],I) onto C([0, T], M). 

Proof. Let a G C([0, T],/). Since a is bounded away from on the compact set [0,T] x q:([0,T]), 
standard arguments based on Lebesgue domination theorem, imply that A(q:) G C([0,r],R). 
Conversely, as L(t, •):/—>■ M is an homeomorphism for every t G [0, T], A admits an inverse defined 
by 

VeGC([0,T],M), A-\0-={t^L-\t,at))) eCi[0,T],I). 
Let Uk denote the topology of the convergence on compact sets of / on C([0,T],7). 

[> UK-Continuity of A on [0,T] x /; If a„ — ^ aoo, the set K = [0,T] x UneN "'^d^' -^]) ^ compact 
set included in /. Hence a is bounded away from on so that 

G [0,r], \L{t,anit))-L{t,a^it))\ < ^J— |a„(t) - aoo(i)| 

mix 

i-e. l|A(an) - A(aoo)||oo < — ll^n - aoolloo- 

miK cr 

> Uk -Continuity of A~^ on [0,T] x /; by using {Ab,a-)-{'i'i), we have for a fixed t G [0,T], 

Vx,x'G/, \Lit,x)-L{t,x')\>-^j^^^^ _^ = _|$(x)-$(a;0|, 

where = sign(2;) log(l + \z\). Thus, 

yy,y' G R, \^{L-\t,y)) - ^L-\t,y'))\ <C\y-y'\. 

Let (^n)n>i be a sequence of functions of D([0,r],M) such that ^„ ^ G C([0,T],M). Then, for 

every t G [0,T] and n > 1, 

|$(L-i(t,en(0)) - ^(^-'(i,o))| <c\ut)\ <ci\\Ut) - eil + UW) + \H^o)\ < c, 

since L~^(t, 0) = xq. Consequently, for every t G [0,T] and every n > 1, L~^{t,£^n{t)) G i^' := 
^>~^([— C, C"]). The set K' is compact (because the function $ is continuous and proper (lim|2,|^oo \ '^{^)\ 
+00)). As uiiK' > 0, we deduce that there exists ryo > such that 

yx,yel, \^{x) - ^{y)\ > r]o\x - y\, 

i.e. 

yte[0,T], yu,v e L{t,I), \L-^{t,u)-L-^{t,v)\<C"\u-v\, C" > 0. 
Hence, one concludes that 

||A-l(^„) - A-l(^oo)||oo < C"Un - eoo)|U. □ 
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B.2.3 Functional co-monotony principle for diffusion 

Definition B.2. The diffusion process (B.27) is admissible if {Ah^a) holds and 

(i) for every starting value xq G /, (B.27) has a unique weak solution which lives in I up to t = +00 
(see Proposition B.3 for a criteria), 

(a) the function (3 defined by 

is continuous on [0, T] x M, non- decreasing in y for every t S [0, T] or Lipschitz in y uniformly 
in t € [0,T], and satisfies 

3K>0 such that \/3{t,y)\ < K{1 + \y\), t e [0,T], y e R. 

Definition B.3. Let F : B{[0,T],R) be a functional. 

(i) The functional F is non- decreasing (resp. non-increasing) on D{[0,T],R) if 

Vai,a2 G ]D)([0,r],R), (Vt G [0,T], ai{t) < a2{t)) =^ F{ai) < ^(02) (resp. F{ai) > F{a2)). 

(ii) The functional F is continuous at a ^ C([0, T],]R) if 

Va„ GB([0,r],M), AaGC([0,r],M), F(a„) ^ F(a). 

where U denotes the uniform convergence of functions on [0, T]. The functional F is C- 
continuous if it is continuous at every a G C([0,T],M). 

(Hi) The functional F has polynomial growth if there exists a positive real number r > such that 

Vq gB([0,T],M), |F(q)| < i^(l + ||a||^). (B.37) 

Remark. Any C-continuous functional in the above sense is in particular ¥z-a.s. continuous for every 
process Z with continuous paths. 

Definition B.4. A process (^t)tg[o,T] with continuous (resp. cadlag stepwise constant j paths defined 
on (ri,^,P) satisfies a functional co-monotony principle if for every C-continuous functional (resp. 
measurable functionals on B([0, T], M)j F, G monotonic with the same monotony satisfying (B.37) such 
that F{X), G{X) and F{X)G{X) G L^ , we have 

Cov [F{{Xt) 

tG[o,T] ) ' G{ {Xt)^^^Q^^ ) ) ^ 0- 
The main result of this section is the following 

Theorem B.2. Assume that the real-valued diffusion process (B.27) is admissible (see Definition B.2). 
Then it satisfies a co-monotony principle. 

Corollary B.l. Assume that the real-valued diffusion process (B.27) is admissible (see Defintion B.2). 

(a) Let (^t™)o<fc<m stepwise constant Euler scheme with step ^ (f^ = ^ , < k < m). Then 
(X^m)^^^^^ satisfies a co-monotony principle. 

(b) Let [Xt^. ] be a sample of discrete time observations 0/ (Xt)t(=fo ri f^r subdivision {tk)o<k<m 
of [0,T](0 = to < ■ ■ ■ < tjn = T). Then ( Xt^. I satisfies a co-monotony principle. 

V / 0<k<m 
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Remark. The proof of Corollary B.l is contained in the proof of Theorem B.2. The only difference 
is that we do not need to transfer the co-monotony principle from the Euler scheme to the diffusion 
process. 

Before passing to the proof of Theorem B.2, we need two lemmas: one is a key step to transfer 
co-monotony from the Euler scheme to the diffusion process, the other aims at transferring uniqueness 
property for weak solutions. 

Lemma B.l. For every a G B([0, r],]R), set 

m—l 

aM = «(*r)l[t™,t™ J + «(r)l{T}, ^>l, (B.38) 

fc=0 

with fV^ := —, k = 0, . . . ,m. Then a^"^^ a as m ^ oo. 

If F : I[!)([0,T],M) — M is C-continuous and non- decreasing (resp. non-increasing), then the 
unique function Fm ■ M™""*"^ — )• M satisfying F{a^"^^) = Fm(a(t^), k = 0, ...,m) is continuous and 
non- decreasing (resp. non-increasing) in each of its variables. Furthermore, if F satisfies a polynomial 
growth assumption of the form 

Va G B([0,T],M), |F(a)| < C{1 + \\a\U 

then, for every m > 1, 

\Fm{xo,...,Xm)\<C{l+ max [xfeH 

0<k<m 

with the same real constant C > 0. 

Lemma B.2. Let {S,d), {T,6) be two Polish spaces and let ^ : S ^ T be a continuous injective 
function. Let fi and fj,' be two probability measures on {S,Bor(S)). If fio^^^ = fi' o^^^, then fi = fi' . 



Proof of Lemma B.2. For every Borel set A of S, n{A) = sup{/i(i^), K C A, K compact}. Let 
A G Bor{S) such that ^i{A) / /i'(^). Then there exists a compact set K oi A such that n{K) ^ n'{K). 
But (^{K) is a compact set of S because <^ is continuous, so (<I>(i^)) is a Borel set of S which 
contains K. As $ is injective, ^''^ (^(i^)) = K. Therefore n (^^^ {^{K))) / fi' (^>~i {^{K))). We 
deduce that ;U o ^ ^' o □ 

Proof of Theorem B.2. First we consider the Lamperti transform iYt)t>Q (see (B.31)) of the 
diffusion X solution to (B.29) with Xq = xq G I. Using the homeomorphism property of A and calling 
upon the above Lemma B.2 with A~^ and A, we see that existence and uniqueness assumptions on 
Equation (B.29) can be transferred to (B.33) since A is a one-to-one mapping between the solutions 
of these two SDE's. 

To fulfill condition {ii) in Definition B.2, we need to introduce the smallest integer, denoted mi^^^, 
such that y ^ y + —^(3(t,y) is non-decreasing in y for every t G [0,T]. Its existence follows from 
Ab^a-{ii) -^ote that if /3 is non-decreasing in y for every t G [0,T], then ruh^a = 1- Then we introduce 
the stepwise constant (Brownian) Euler scheme Y"^ = {YkT^) ^^^^^^ with step ^ (defined by (B.30)) of 
Y = (i^)t6[o,T] with m > mb^„. It is clear by induction on k that there exists for every k G {1, . . . , m} 
a function 0^ : M''+^ R such that 

% = ek{yo,AWt^,...,AWt^) 
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where for (yg, Zi, . . . , z^) € M'^+-'^, 

T 

Qk{yo,zi, ...,zk) = @k-i{yo,zi, . . .,zk~i) + /3(t™„i,GA.._i(7/o,2;i, • • ■,zk-i)) — h zk 

m 

T\ 

id + •)— j oQk-i{yo-,zi,...,zk-i) + Zk. 

Thus for every z G {1, . . . , k}, zi ^ Qkivo, zi, . . . , Zi, . . . , Zk) is non-decreasing because y ^ {y + P{t^_i,i 
is non-decreasing for m large enough, say m > rrih^a- We deduce that if Fm '■ M™"^^ — )• M is non- 
decreasing in each variables, then, for every i S {1, . . . , k}, 

Zi ^ Fm {yo, 0i(yo, 2:1), . . . , Qm{yo, zi,..., Zm)) is non-decreasing. 

By the same reasoning, we deduce that for Gm '■ W"^^^ — ?> M, non-increasing in each variables, we have 
for every z € {1, . . . , k}, 

Zi ^ Gm (yo, 0i(yo, zi),..., 6m (yo, zi,..., Zm)) is non-increasing. 

Let Fm and Gm be the functions defined on M™+^ associated to F and G respectively by Lemma B.l. 
As /3 has linear growth, Y and its Euler scheme have polynomial moments at any order p > 0. Then 
we can apply Proposition B.2 to deduce that 



E[FG(F'^)] = E 

> E 



Fm Y,t] ]Gm Y, 



m J 0<k<m I \\ m J 0<k<m 



Fm I ( , 

m J 0<k<m 









E 


Gm ^ 






\ m / 0<k<m J 



E [F (y™)] E [G (y™)] 



Note that if F and G are C-continuous with polynomial growth, so is FG. We derive from 
Theorem B.l that 

ElFGiY"")] EFG{Y), E [F (F'")] EF{Y), ErCfF™)] EG{Y), 

therefore 

Coy {F{Y),G{Y)) > 0. 

To conclude the proof, we need to go back to the process X by using the inverse Lamperti transform. 
Indeed, for every t G [0,r], Xt = L-^{t,Yt), where Y satisfies (B.33). Let F : B([0,r],R) ^ R C- 
continuous. Set 

Vq e C([0,r],M), F{a) := F (^{L-\t,at))^^^^^^^) . 

Assume first that F and G are bounded. The functional F is C-continuous owing to Proposition B.4, 
non-decreasing (resp. non-increasing) since L~^{t,.) is for every t £ [0, T] and is bounded. Conse- 
quently, 

Gov {F{X), G{X)) = Gov (f{Y),G{Y)) > 0. 



To conclude we approximate F and G in a robust way with respect to the "constraints" , by a canonical 
truncation procedure, say 

:= max ('(-M),min (F,M)), Af € N. 



If F and G have polynomial growth, it is clear that Cov {Fm {X) , G m {X)) — )• Gov (F(X), G(X)) as 
M ^ 00. □ 
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Examples of admissible diffusions. • The Bachelier model: This simply means that = ^t+aWt, 
fj > 0, clearly fulfills the assumptions of Theorem B.2. 

• The Black-Scholes model: The diffusion process X is a geometric Brownian motion, solution to 
the SDE 

dXt = rXtdt + ^XtdWt, Xq = xq > 0, 

where r € M and > are real numbers. The geometric Brownian motion lives in the open interval 
I = (0, +oo) and j5{y) = § — f is constant. One checks that L{x) = ^ log (^^) where xi e (0, +oo) 
is fixed. 

• The Hull-White model: It is an elementary improvement of the Black-Scholes model where 
-i? : [0, T] (0, +oo) is a deterministic positive function i.e. the diffusion process X is a geometric 
Brownian motion solution of the SDE 

dXt = rXtdt + {}{t)XtdWt, Xq = xq > 0. 

Then, elementary stochastic calculus shows that 

where {Bu)u>o is a standard Brownian motion (the second equality follows form the Dambins-Dubins- 
Schwarz theorem). 

Consequently Xt = -^j' ^2(^)^5) where the functional ^ ^ ^ j ^'^{s)ds^^^ defined 



on D([0,T^],R), T^ = /q 'd'^{t)dt, is C-continuous on C([0,T^],M). Hence for any C-continuous M- 
functional on D([0,r],M), the M- valued functional F defined by F(^) = F(^^p{t,i(^ j ?9^(s)ds^^^ is 
C-continuous on D([0, T^], M). Then, on can transfer the co-monotony property from B to X. 

• Local volatility model (elliptic case): More generally, it applies still with / = (0, +00) to some 
usual extensions like the models with local volatility 



dXt = rXtdt + 'd{Xt)XtdWu Xo = xo> 0, 



where -i? : M — )• {'dQ,+oo), i?o > 0, is a bounded, twice differentiable function satisfying < *^ 



and W"{x)\ < x G (0,+cxd). 

In this case / = (0, +00) and, xi € / being fixed, one has for every x G I, 



L{x) 



^' di 



which clearly defines an increasing homeomorphism from / onto M since i? is bounded. 

Furthermore, one easily derives from the explicit form (B.35) and the condition (B.36) that f3 is 
Lipschitz as soon as the function 

■!?' x'^M"(x) 

X 1-^ rx—(x) H h x'&'&'(x) is bounded on (0, 00) 

w 2 

which easily follows from the assumptions made on 
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Extension to other classes of diffusions and models. This general approach does not embody 
all situations: thus the true CEV model does not fulfill the above assumptions. The CEV model is a 
diffusion process X following the SDE 

dXt = rXtdt + ^X^dWt, Xq = xq, 

where i9 > and < a < 1 are real numbers. 

So this CEV model, for which I = (0, +oo), does not fulfill Definition ^f„(T-(^^0- a consequence 
L{t, I) 7^ M is an open interval (depending on the choice of xi. To be precise, if xi £ (0, +oo) is fixed. 



so that, if we set 



L defines an homeomorphism from / = (0, +00) onto Jx^. Finally the function /3 defined by 

T -1 (yij 1 

/3(y) = -a)y + x|-") , yeJxi 

^^^^ ^ 1 ; 2 _a)y + x}"")' ^ ' 

is non-decreasing with linear growth at +00. Now, tracing the lines of the above proof, in particular 
establishing weak existence and uniqueness of the solution of the SDE (B.29) in that setting, leads to 
the same positive conclusion concerning the covariance inequalities for co-monotonic or anti-monotonic 
functionals. 
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